Audio-Visual Identity Grounding for Enabling Cross Media Search
نویسنده
چکیده
Automatically searching for media clips in large heterogeneous datasets is an inherently difficult challenge, and nearly impossibly so when searching across distinct media types (e.g. finding audio clips that match an image). In this paper we introduce the exploitation of identity grounding for enabling this cross media search and exploration capability. Through the use of grounding we leverage one media channel (e.g. visual identity) as a noisy label for training a model in a different channel (e.g. audio speaker model). Finally, we demonstrate this search capability using images from the Labeled Faces in the Wild (LFW) dataset to query audio files that have been extracted from the YouTube Faces (YTF) dataset.
منابع مشابه
Announcing the Final Examination of Kai Li for the degree of Doctor of Philosophy Time & Location: June 6, 2017 at 10:00 AM in HEC 450 Title: Hashing for Multimedia Similarity Modeling and Large-scale Retrieval
In recent years, the amount of multimedia data such as images, texts, and videos have been growing rapidly on the Internet. Motivated by such trends, this thesis is dedicated to exploiting hashing-based solutions to reveal multimedia data correlations and support intra-media and inter-media similarity search among huge volumes of multimedia data. We start by investigating a hashing-based soluti...
متن کاملAnnouncing the Final Examination of Kai Li for the degree of Doctor of Philosophy Time & Location: June 6, 2017 at 10:00 AM in HEC 450 Title: Hashing for Multimedia Similarity Modeling and Large-scale Retrieval
In recent years, the amount of multimedia data such as images, texts, and videos have been growing rapidly on the Internet. Motivated by such trends, this thesis is dedicated to exploiting hashing-based solutions to reveal multimedia data correlations and support intra-media and inter-media similarity search among huge volumes of multimedia data. We start by investigating a hashing-based soluti...
متن کاملAnnouncing the Final Examination of Kai Li for the degree of Doctor of Philosophy Time & Location: June 6, 2017 at 10:00 AM in HEC 450 Title: Hashing for Multimedia Similarity Modeling and Large-scale Retrieval
In recent years, the amount of multimedia data such as images, texts, and videos have been growing rapidly on the Internet. Motivated by such trends, this thesis is dedicated to exploiting hashing-based solutions to reveal multimedia data correlations and support intra-media and inter-media similarity search among huge volumes of multimedia data. We start by investigating a hashing-based soluti...
متن کاملThe Effect of Multimedia Teaching Intervention on Physical Education Curriculum on University Students’ Sports Attitudes and Sports Behaviors
Background. World Health Organization pointed out in 2019 that insufficient physical activity has become the fourth major risk factor affecting global mortality. Objectives. This research explores the influence of multi-media teaching intervention in the physical education curriculum on college students’ sports attitudes and behavior. Methods. The subjects of weight training and Yogalates cou...
متن کاملحقوق تولیدکنندگان ابزار رسانههای صوتی و تصویری
Media tools include phonogram and videogram which do not create any work but they cooperate in recording the work. Therefore, phonogram and videogram producers have an important role in work consolidation. The rights of producers of audio-visual media tools are part of the Related Rights. However, most legislation only deals with supporting producers of audio media tools, and the producers of v...
متن کامل